Channel and noise normalization using affine transformed cepstrum
نویسندگان
چکیده
This paper addresses the environmental mismatch problem that arises from noise and channel variabilities. A new feature mapping technique based on an optimal a ne transform of the cepstrum is proposed to solve the mismatch problem observed over the speaker recognition systems. It is designed based on the fact that both the channel and noise interferences basically cause the cepstrum space to undergo an a ne transformation. By taking an inverse transformation, we can easily decouple from the speech the e ects of the channel and noise. Alternatively, we can take a forward transform of the training data to simulate the operating conditions.
منابع مشابه
Blind normalization of speech from different channels
We show how to construct a channel-independent representation of speech that has propagated through a noisy reverberant channel. This is done by blindly rescaling the cepstral time series by a nonlinear function, with the form of this scale function being determined by previously encountered cepstra from that channel. The rescaled form of the time series is an invariant property of it in the fo...
متن کاملChannel compensation of modulation spectral features
We propose a new channel compensation method for modulation spectral features. We compare our proposed method, subband normalization, with a more traditional method, cepstral mean subtraction (CMS). Experimental results show that subband normalized modulation scale features provide advantages over CMS features. The proposed method is not only robust to slowly varying convolutional noise, but al...
متن کاملAutomatic Segmentation of Speech Recorded inUnknown Noisy Channel
This paper investigates the problem of automatic segmentation of speech recorded in noisy channel corrupted environments. Using an HMM-based speech segmentation algorithm, speech enhancement and parameter compensation techniques previously proposed for robust speech recognition are evaluated and compared for improved segmentation in colored noise. Speech enhancement algorithms considered includ...
متن کاملNoise and Channel Normalized Cepstral Features for Far-speech Recognition
The paper analyses suitable features for distorted speech recognition. The aim is to explore the application of command ASR system when the speech is recorded with far-distance microphones with a possible strong additive and convolutory noise. The paper analyses feasible contribution of basic spectral subtraction coupled with cepstral mean normalization in minimizing of the influence of present...
متن کاملMaximum likelihood normalization for robust speech recognition
It is well-known that additive and channel noise cause shift and scaling in MFCC features. Empirical normalization techniques to estimate and compensate for the effects, such as cepstral mean subtraction and variance normalization, have been shown to be useful. However, these empirical estimate may not be optimal. In this paper, we approach the problem from two directions, 1) use a more robust ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996